Search CORE

18 research outputs found

Comparing the hierarchy of keywords in on-line news portals

Author: A Clauset
A Trusina
AL Barabási
B Corominas-Murtra
B Corominas-Murtra
C Cattuto
C Cattuto
C Goessmann
CV Damme
D Czégel
D Pumain
David Sousa-Rodrigues
DW McShea
E Mones
E Ravasz
ET Wimberley
F Floeck
FJ Brandenburg
G Ghosal
G Palla
G Tibély
G Tibély
Gergely Palla
Gergely Tibély
H Fushing
H Hirata
HW Ma
J Wickens
JI Perotti
K Juszczyszyn
L Lu
M Batty
M Fattore
M Kaiser
M Nagy
M Nagy
N Eldredge
P Heymann
P Mika
P Pollner
P Spyns
Peter Csermely
PR Krugman
Péter Pollner
R Guimerà
R Lambiotte
S Valverde
SN Dorogovtsev
V Zlatić
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2016
Field of study

The tagging of on-line content with informative keywords is a widespread phenomenon from scientific article repositories through blogs to on-line news portals. In most of the cases, the tags on a given item are free words chosen by the authors independently. Therefore, relations among keywords in a collection of news items is unknown. However, in most cases the topics and concepts described by these keywords are forming a latent hierarchy, with the more general topics and categories at the top, and more specialised ones at the bottom. Here we apply a recent, cooccurrence-based tag hierarchy extraction method to sets of keywords obtained from four different on-line news portals. The resulting hierarchies show substantial differences not just in the topics rendered as important (being at the top of the hierarchy) or of less interest (categorised low in the hierarchy), but also in the underlying network structure. This reveals discrepancies between the plausible keyword association frameworks in the studied news portals

arXiv.org e-Print Archive

Crossref

Directory of Open Access Journals

PubMed Central

ELTE Digital Institutional Repository (EDIT)

FigShare

Effects of time window size and placement on the structure of aggregated networks

Author: A Barrat
A Gautreau
A Nanavati
G Krings
G Miritello
G Tibély
HH Jo
J Candia
J Onnela
J Onnela
M Karsai
M Newman
M Newman
M Seshadri
MS Granovetter
P Holme
P Holme
R Lambiotte
S Fortunato
V Blondel
W Aiello
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

Complex networks are often constructed by aggregating empirical data over time, such that a link represents the existence of interactions between the endpoint nodes and the link weight represents the intensity of such interactions within the aggregation time window. The resulting networks are then often considered static. More often than not, the aggregation time window is dictated by the availability of data, and the effects of its length on the resulting networks are rarely considered. Here, we address this question by studying the structural features of networks emerging from aggregating empirical data over different time intervals, focussing on networks derived from time-stamped, anonymized mobile telephone call records. Our results show that short aggregation intervals yield networks where strong links associated with dense clusters dominate; the seeds of such clusters or communities become already visible for intervals of around one week. The degree and weight distributions are seen to become stationary around a few days and a few weeks, respectively. An aggregation interval of around 30 days results in the stablest similar networks when consecutive windows are compared. For longer intervals, the effects of weak or random links become increasingly stronger, and the average degree of the network keeps growing even for intervals up to 180 days. The placement of the time window is also seen to affect the outcome: for short windows, different behavioural patterns play a role during weekends and weekdays, and for longer windows it is seen that networks aggregated during holiday periods are significantly different.Comment: 19 pages, 11 figure

arXiv.org e-Print Archive

Crossref

Springer - Publisher Connector

Aaltodoc Publication Archive

DIAL UCLouvain

Identifying Overlapping and Hierarchical Thematic Structures in Networks of Scholarly Papers: A Comparison of Three Approaches

Author: A Clauset
A Clauset
A Friggeri
A Lancichinetti
A Lancichinetti
A Van Raan
Alexander Struck
B Ball
C Lee
C Lee
D Sullivan
F Havemann
F Havemann
F Janssens
F Janssens
F Radicchi
Frank Havemann
G Tibély
H Small
IV Marshakova
J Baumes
J Baumes
J Gläser
J Xie
Jochen Gläser
M Rosvall
M Sales-Pardo
M Zitt
Michael Heinz
O Amsterdamska
O Mitesser
R Klavans
Renaud Lambiotte
S Fortunato
S Ghosh
S Gregory
S Gregory
T Evans
V Blondel
W Zachary
X Wang
Y Ahn
Y Kim
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 26/07/2011
Field of study

We implemented three recently proposed approaches to the identification of overlapping and hierarchical substructures in graphs and applied the corresponding algorithms to a network of 492 information-science papers coupled via their cited sources. The thematic substructures obtained and overlaps produced by the three hierarchical cluster algorithms were compared to a content-based categorisation, which we based on the interpretation of titles and keywords. We defined sets of papers dealing with three topics located on different levels of aggregation: h-index, webometrics, and bibliometrics. We identified these topics with branches in the dendrograms produced by the three cluster algorithms and compared the overlapping topics they detected with one another and with the three pre-defined paper sets. We discuss the advantages and drawbacks of applying the three approaches to paper networks in research fields.Comment: 18 pages, 9 figure

arXiv.org e-Print Archive

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

FigShare

Community landscapes: an integrative approach to determine overlapping network module hierarchy, identify key nodes and predict network dynamics

Author: A Arenas
A Capocci
A Hinneburg
A Lancichinetti
A Lancichinetti
AK Ramani
C Baerveldt
D Ekman
D Krioukov
DJ Watts
DL Nelson
ER Gansner
F Radicchi
G Palla
G Tibély
H Yu
I Kovacs
I Vragovic
István A. Kovács
J Moody
JB Axelsen
JD Han
JM Kumpula
JM Thevelein
JP Bagrow
JP Eckmann
JW Berry
K Komurov
M Blatt
M Fiedler
M Girvan
M Grendar
M Rosvall
ME Newman
ME Newman
ML Clark
Máté S. Szalay
N Bertin
Olaf Sporns
P Csermely
P Pons
Peter Csermely
PM Kim
Robin Palotai
S Fortunato
S Fortunato
S Fortunato
T Nepusz
TS Evans
V Latora
VD Blondel
WW Zachary
Y-Y Ahn
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2010
Field of study

Background: Network communities help the functional organization and evolution of complex networks. However, the development of a method, which is both fast and accurate, provides modular overlaps and partitions of a heterogeneous network, has proven to be rather difficult. Methodology/Principal Findings: Here we introduce the novel concept of ModuLand, an integrative method family determining overlapping network modules as hills of an influence function-based, centrality-type community landscape, and including several widely used modularization methods as special cases. As various adaptations of the method family, we developed several algorithms, which provide an efficient analysis of weighted and directed networks, and (1) determine pervasively overlapping modules with high resolution; (2) uncover a detailed hierarchical network structure allowing an efficient, zoom-in analysis of large networks; (3) allow the determination of key network nodes and (4) help to predict network dynamics. Conclusions/Significance: The concept opens a wide range of possibilities to develop new approaches and applications including network routing, classification, comparison and prediction.Comment: 25 pages with 6 figures and a Glossary + Supporting Information containing pseudo-codes of all algorithms used, 14 Figures, 5 Tables (with 18 module definitions, 129 different modularization methods, 13 module comparision methods) and 396 references. All algorithms can be downloaded from this web-site: http://www.linkgroup.hu/modules.ph

arXiv.org e-Print Archive

CiteSeerX

Crossref

Directory of Open Access Journals

PubMed Central

ELTE Digital Institutional Repository (EDIT)

A survey of results on mobile phone datasets analysis

Author: A Amini
A Bogomolov
A Bogomolov
A Bogomolov
A Clauset
A Kuusik
A Narayanan
A Noulas
A Stopczynski
A Wesolowski
AA Nanavati
AL Barabási
AL Barabási
AL Barabási
B Csáji
C Cortes
C Herrera-Yagüe
C Ratti
C Ratti
C Smith-Clarke
C Song
C Song
CA Hidalgo
CO Buckee
D Grady
D Lazer
D Liben-Nowell
D Naboulsi
D Quercia
D Wang
DJ Mir
DJ Watts
DJ Watts
E Carolan
E Ferrara
E Frias-Martinez
E Katz
ED Fitkov-Norris
EU
F Baccelli
F Calabrese
F Calabrese
F Calabrese
F Calabrese
F Calabrese
F Manfredini
F Peruani
F Simini
FHZ Xavier
FHZ Xavier
G Ghoshal
G Kossinets
G Krings
G Krings
G Krings
G Miritello
G Miritello
G Miritello
G Miritello
G Palla
G Ranjan
G Tibély
GK Zipf
H Mao
H Risselada
H Sterly
H Zang
H Zhang
H-H Jo
H-H Jo
H-H Jo
I Trestian
J Abello
J Candia
J Karikoski
J Karikoski
J McInerney
J Park
J Reades
J Reades
J Saramäki
J Steenbruggen
J Wiese
J-P Onnela
JE Blumenstock
JE Blumenstock
JE Blumenstock
JL Toole
JP Bagrow
JP Bagrow
JP Onnela
JP Onnela
JP Onnela
JP Onnela
K Dasgupta
K Kianmehr
K Yu
KS Xu
KS Xu
L Backstrom
L Gao
L Kovanen
L Kovanen
L Kovanen
L Sweeney
L Sweeney
L Tabourier
M Barthélemy
M Berlingerio
M Cebrian
M Karsai
M Karsai
M Karsai
M Karsai
M Kivelä
M Martino
M Nanni
M Pielot
M Rosvall
M Schläpfer
M Seshadri
M Tizzoni
M-X Li
MC González
MEJ Newman
MEJ Newman
MS Granovetter
N Aharony
N Du
N Eagle
N Eagle
N Eagle
N Eagle
N Eagle
N Eagle
O Bucicovschi
P Deville
P Expert
P Holme
P Wang
P Wang
P Wang
PJ Mucha
R Kwok
R Lambiotte
R Ling
R Trasarti
RD Malmgren
S Catanese
S Gambs
S Hill
S Isaacman
S Isaacman
S Isaacman
S Jiang
S Kirkpatrick
S Landau
S Motahari
SY Hung
T Aynaud
T Dierkes
T Louail
T Raeder
V Angelakis
V Blondel
V Blondel
V Frias-Martinez
V Frias-Martinez
V Frias-Martinez
V Frias-Martinez
V Palchykov
V Salnikov
V-P Backlund
VD Blondel
VD Blondel
W Aiello
X Lu
Y Altshuler
Y Kim
Y Kryvasheyeu
Y Richter
Y Song
Y Wu
YA Montjoye de
YY Ahn
YY Liu
Z Huang
Z Smoreda
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

THE EFFECT OF DISORDER ON THE HIERARCHICAL MODULARITY IN COMPLEX SYSTEMS

Author: Barabási A.-L.
D. NAGY
G. TIBÉLY
J. KERTÉSZ
Publication venue: 'World Scientific Pub Co Pte Lt'
Publication date
Field of study

Crossref

Comparing the Hierarchy of Keywords in On-Line News Portals

Author: A Clauset
A Trusina
AL Barabási
B Corominas-Murtra
B Corominas-Murtra
C Cattuto
C Cattuto
C Goessmann
CV Damme
D Czégel
D Pumain
David Sousa-Rodrigues
DW McShea
E Mones
E Ravasz
ET Wimberley
F Floeck
FJ Brandenburg
G Ghosal
G Palla
G Tibély
G Tibély
Gergely Palla
Gergely Tibély
H Fushing
H Hirata
HW Ma
J Wickens
JI Perotti
K Juszczyszyn
L Lu
M Batty
M Fattore
M Kaiser
M Nagy
M Nagy
N Eldredge
P Heymann
P Mika
P Pollner
P Spyns
Peter Csermely
PR Krugman
Péter Pollner
R Guimerà
R Lambiotte
S Valverde
SN Dorogovtsev
V Zlatić
Publication venue: 'Public Library of Science (PLoS)'
Publication date
Field of study

Crossref

An Evaluation of Community Detection Algorithms on Large-Scale Email Traffic

Author: A. Lancichinetti
B. Viswanath
G. Tibély
H. Almeida
J. Leskovec
M. Girvan
M. Newman
M. Rosvall
P. Ronhovde
R. Guimerà
S. Fortunato
S.E. Schaeffer
T. Evans
U. Brandes
Y.-Y. Ahn
Publication venue
Publication date: 01/01/2012
Field of study

Community detection algorithms are widely used to study the structural properties of real-world networks. In this paper, we experimentally evaluate the qualitative performance of several community detection algorithms using large-scale email networks. The email networks were generated from real email traffic and contain both legitimate email (ham) and unsolicited email (spam). We compare the quality of the algorithms with respect to a number of structural quality functions and a logical quality measure which assesses the ability of the algorithms to separate ham and spam emails by clustering them into distinct communities. Our study reveals that the algorithms that perform well with respect to structural quality, don’t achieve high logical quality. We also show that the algorithms with similar structural quality also have similar logical quality regardless of their approach to clustering. Finally, we reveal that the algorithm that performs link community detection is more suitable for clustering email networks than the node-based approaches, and it creates more distinct communities of ham and spam edges

Crossref

Chalmers Research

Chalmers Publication Library

Comparing the hierarchy of author given tags and repository given tags in a large document archive

Author: A. Clauset
A. Trusina
B. Corominas-Murtra
B. Corominas-Murtra
C. Cattuto
C. Cattuto
C. Goessmann
C.V. Damme
D.W. McShea
E. Mones
E. Ravasz
G. Tibély
Gergely Palla
Gergely Tibély
H. Fushing
H. Hirata
H.W. Ma
J. Wickens
M. Kaiser
M. Nagy
P. Pollner
P.R. Krugman
Péter Pollner
R. Guimerà
R. Lambiotte
S. Valverde
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref